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Both cellular automata (CA) and lattice-gas automata (LG) provide finite algoritlimic presentations for 
certain classes of infinite dynamical systems studied by symbolic dynamics; it is customary to use the term 
'cellular automaton' or 'lattice gas' for the dynamic system itself as well as for its presentation. The two kinds 
of presentation share many traits but also display profound differences on issues ranging from decidability 
to modeling convenience and physical implementability. 

Following a conjecture by Toffoli and Margolus, it had been proved by Kari (and by Durand-Lose for more 
than two dimensions) that any invertible CA can be rewritten as an LG (with a possibly much more complex 
"unit cell"). But until now it was not known whether this is possible in general for noninvertible CA — which 
comprise "almost all" CA and represent the bulk of examples in theory and applications. Even circumstantial 
evidence — whether in favor or against — was lacking. 

Here, for noninvertible CA, (a) we prove that an LG presentation is out of the question for the vanishingly 
small class of surjective ones. We then turn our attention to all the rest — noninvertible and nonsurjective — 
which comprise all the typical ones, including Conway's 'Game of Life'. For these (b) we prove by explicit 
construction that all the one- dimensional ones are representable as LG, and (c) we present and motivate the 
conjecture that this result extends to any number of dimensions. 

The tradeoff between dissipation rate and structural complexity implied by the above results have compelling 
implications for the thermodynamics of computation at a microscopic scale. 
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I do not know of any single instance where something useful 
for the work on lattice gases has been borrowed from the 
cellular automata held. . . . Lattice gases differ in essence 
from cellular automata. A confusion of the two fields dis- 
torts our thinking, hides the special properties of lattice 
gases, and makes it harder to develop a good intuition. 
(Michel Henonjl^, with specific reference to Wolfram [42)1 



1 Introduction 

Cellular automata (CA) provide a quick modeling route to 
phenomenological aspects of nature — especially the emer- 
gence of complex behavior in dissipative systems. But 
lattice-gas automata (LG) are unmatched as a source of 
fine-grained models of fundamental aspects of physics, es- 
pecially for expressing the dynamics of conservativeQ sys- 
tems. 

In the quote at the beginning of this paper, one may well 
sympathize with Henon's annoyance: it turns out that dy- 
namical behavior that is synthesized with the utmost nat- 
uralness when using lattice gases as a "programming lan- 
guage" become perversely hard to express in the cellular 
automata language. Yet, Henon's are visceral feelings, not 

dynamics is called 'conservative' if it is the manifest expression 
of an invertible microscopic mechanism. It is called 'dissipative' if 
the underlying mechanism is not invertible to begin with, or if its 
invertibility is de facto irrelevant because one is not capable or willing 
to maintain a strict accounting of microscopic states — perhaps owing 
to lack of precise knowledge of the initial state and the evolution laws, 
impredictable influences on the part of the environment, or the sheer 
size of the task. 



argued conclusions. With as much irritation one could re- 
tort, "How can lattice gases differ 'in essence' from cellular 
automata if they are merely a subset of them? What are 
these CA legacies that may 'distort our thinking' and 'hide 
the special properties of lattice gases' ? And aren't there dy- 
namical systems that are much more naturally and easily 
modeled as cellular automata?" 

Today, with the benefit of twenty years' hindsight — and 
especially after the results of the present paper — we are in a 
position to defuse the argument. Henon's appeal could less 
belligerantly be reworded as follows: "Even though CA and 
LG describe essentially the same class of objects, for sound 
pedagogical reasons it may be expedient to deal with them 
in separate chapters — or even in separate books for different 
audiences and applications. What is ox in the stable may 
well be beef on the table." The bottom-line message is 
that these two modeling approaches do not reflect mutually 
exclusive strategies, but just opposite tradeoffs between the 
structural complexity of a piece of computing machinery 
and its thermodynamic efficiency. 



2 Preview 

Let C and L be the sets of dynamical systems representable, 
respectively, as cellular automata and lattice gases. Our 
overall question is, How are these two sets related? On 
one hand, we shall see that any lattice gas can be trivially 
rewritten as a cellular automaton — hence L C C. As for the 
converse issue, i.e., how much of C is in L, let's recall that 
C is naturally partitioned into three classes which reflect 



fundamental categorical properties, namely, 

descriptive name categorical property 

invertible injective and surjective 

C° almost invertible noninjective but surjective 

C~ locally lossy noninjective, nonsurjective 

According to a conjecture made by one of us in 1990 and 
proved by Kari in 1996 for the ID and 2D cases and 
Durand-Lose in 2001 for the general case, the vanishingly 
small class C+ of invertible cellular automata is definitely 
"in." For the other two classes, evidence has been lacking 
either wajH — which is particularly irritating for such decep- 
tively simple CA as the one consisting merely of a row of 
2-input AND gates. 

Here we first dismiss, as definitely "out," the vanishingly 
small class C° of almost invertible CA — where, while the 
system as a whole loses in one step a nonzero amount of 
information, the amount of loss per site is nonetheless zero. 

What is left is the class C~ of locally lossy CA; these 
lose in one step a nonzero amount of information per site. 
This class comprises almost all cellular automata. Here 
we present a procedure for rewriting any one-dimensional 
CA into an LG (which will have a more complex unit 
cell, consisting of two layers and spanning more than one 
site of the original CA). We also show a procedure for a 
simple 2D example. Work is in progress on an analogous 
construction (with a number of layers that increases with 
the number of dimensions) hoped to be of general applica- 
bility. On the basis of these partial results, we propose the 
conjecture that — exception made for the paradoxical class 
C° already dismissed above — the family of dynamical sys- 
tems presentable as cellular automata and that presentable 
as lattice gases coincide. 

3 Background 

Symbolic dynamics studies a class of dynamical systems 
which display topological continuity (reflecting locality of 
interaction) and invariance under space and time transla- 
tion (the regularity of a "spacetime crystal" [39 ) — and can 
thus be viewed as discrete, locally-finite versions of the par- 
tial differential equations of field theories. For example, the 
local map of a simple one-dimensional cellular automaton, 
which is a a recurrence relation of the form 

is analogous to the well-known forward-time, forward-space 
finite-difference scheme [5^ p. 13] 

for the differential equation 

dq dq 
dt dx 

^Except for second-order cellular automata, already proved "in" 
by us in 2004 (see ^7.11 1. and a few other sporadic cases. Of course, 
for any CA that was obtained from an LG as per Theor. [T] an LG 
presentation is automatically available. 



(One must keep in mind that in a cellular automaton the 
state set for 5* is restricted to a finite alphabet A, while for 
a differential equation it ranges over the whole continuum 
]R. On the other hand, the local map / of a cellular automa- 
ton is an unrestricted function of its arguments, while the 
corresponding map g of the finite-difference scheme can only 
see its arguments through "small differences" — qx+i — qx, 
of the order of h, in space, and <?'+^ — g*, of the order of k, 
in time.) 

Though symbolic dynamics systems are generally infinite, 
both cellular automata and lattice-gas automata manage to 
provide hnite algorithmic presentations for certain classes 
of them. These two kinds of presentation share a number 
of traits but also display profound differences — on issues 
ranging from decidability to physical implementability — to 
the point that one might suspect that CA and LG specify 
quite different kinds of dynamical systems (see the Hcnon 
quote at the beginning of this paper). 

On the other hand, for a number of reasons, some empir- 
ical and some esthetical, we've long been entertaining the 
notion — the hope, if you wish, since the evidence was ex- 
tremely scant cither way — that in fact CA and LG provide 
just different presentations for the same kind of objects. In 
other words, letting C denote the set of dynamical systems 
representable as cellular automata and L the set of those 
representable as lattice gases, one may ask. How are C and 
L related? Are they disjoint, partially overlapping, or even 
coincident? If the latter were true, how come that LG pre- 
sentations of even the very simplest CA are so hard to come 
by? 
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Figure 1: A simple one-dimensional cellular automaton; this 
circuit can be thought of as a presentation of a dynamical system 
on the full shift [25]. For concreteness, assume that the site state 
alphabet is binary and the local transition function at each site, 
denoted by a shaded gate, is a logic AND; the gates are fed by 
fanout (or "signal replicator") nodes, denoted by dots. Is there 
an equivalent lattice gas? 

Take, for example, the simplest nontrivial CA, namely, 
a string of 2-input and gates fed by fanout nodes (Fig.[T]). 
Until now, no one had managed to exhibit a functionally 
equivalent LG or, contrariwise, prove that such an LG can- 
not exist! What's perhaps even more intriguing, this very 
question has never (as far as we know) been raised in the 
literature, even though the analogous problem for invertible 
CA had been stated and made the object of a conjecture 
fifteen years ago[33 and then positively solved in the en- 
suing decade [ini [S]; and similarly, though more recently, 
for second-order CA[35]. Apparently, everyone was just as 
clueless about the present simple and example as we our- 
selves were until yesterday. 

Before proceeding further it will be convenient to infor- 
mally recall the definition of 'lattice gas' and the structural 
difference between cellular automata and lattice gases. 
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Figure 2: Format of a simple one-dimensional lattice gas. Here 
we may imagine g to be an arbitrary function of two binary in- 
puts, yielding as a result two distinct binary outputs. No fanout 
of signals (cf. the fanout nodes of Fig.[T)| is permitted. The labels 
a and h distinguish the two inputs as well as the two outputs; 
they actually are labels for arcs rather than I/O ports. 

The simplest nontrivia0 LG have the format of Fig. [21 
Instead of having, as in a cellular automaton (Fig.[T]), a 
single output from each node and making as many copies 
of it as necessary to "fan it out" to the nodes that at the 
next time step will use it as an input, a lattice gas does 
not make use of signal fanout; instead, each node has as 
many outputs (these may have different values, not just 
copies of the same one) as it has inputs. More precisely, 
for a CA of state alphabet A and number of neighbors n 
the local map is of the form / : v4" A. In an LG, on 
the other hand, the state alphabet has the structure A — 
A1XA2X ■ ■ ■ xAn (the factor alphabets Ai, . . . , An need not 
have the same number of elements), and the local map has 
the form g : A1XA2X ■ ■ ■ xA„ — > A1XA2X ■ ■ ■ xA„, with 
no fanout required or permitted. 

As we shall see in Theor. [H lattice-gas automata can 
be thought of as a special case of cellular automata. 
They were used (without being given a special name) by 
[33l [34] and then extensively investigated by Margolus and 
TofFolipTl [551 [37]; one term used at that time was 'par- 
titioning cellular automata'. In 1985, a rudimentary CA 
model of fluid dynamics discovered independently by the 
latter research program turned out to be quite similar to 
one proposed a few years before by the hydrodynamicist 
Yves Pomeau and colleagues; in turn, this convergence soon 
stimulated a whole industry of "lattice-gas hydrodynamics" 
research (see [37| for references). At that time, since the 
term 'lattice gas' had already been used for sundry discrete 
models of fluid dynamics, it became customary to call those 
special cellular automata 'lattice-gas automata' or simply 
'lattice gases'; the term 'block cellular automata' is also in 
common use. The no-fanout constraint is used with the 
same meaning, though in an independent line of research, 
in Girard's linear logic 15 . There, as here, multiple uses 
require explicit duplication, with all the costs (in infras- 
tructure or running expenses) that that may entail (see also 
0). 

The prototypical event in a lattice gas is the collision of 
abstract tokens or "particles." In many practical cases this 
may involve a mere reshuffling of them. A generalization of 
this process is when what is permuted are not the particles 
but the states of the input tuple (a reshuffling of particle is 
then the special case of a permutation of the indices of this 
tuple); this "transmutation" may be visualized as the cre- 

''A CA is trivial if it has a neighborhood of size 1, and thus, up 
to a translation, the dynamics decomposes into identical independent 
dynamics for each site. 



ation, out of the collision of particles, of an "excited state" 
(corresponding to a lattice-gas node), which on decaying 
releases particles that may differ from the original ones. 

Note that in Fig.[Il as wefl as Fig.[21 the state of the 
system at time t is the infinite collection of signals crossing 
the dashed line t. The combinational-logic diagram drawn 
between the two dashed lines constitutes a presentation of 
the dynamical system, that is, a way to indicate by an ex- 
plicit, locally- finite algorithm the functional dependency of 
the state at time t+1 on that at time t. Namely, for the CA 
of Fig.lH using i as a sequential site index, the algorithm 
specifies that 

= AND(q*,g*+i); 

different presentations are of course possible for the same 
system. In the LG of Fig. [21 the state-component at each 
site consists of a pair of signals, labeled a and &, and the 
local algorithm specifies that 

a*+i=5a(a*,5*+i) and 6*+^ = g,(a*, 6*+i). 

Either in a CA or in an LG the presentation may consist of 
any finite number of combinational-logic layers, as in Fig. [51 
rather than a single layer as in the previous two figures, each 
layer obeying the respective discipline for a CA or an LG. 

Finally, two dynamical-system presentations are equiv- 
alent (or conjugate [25]) if the systems they describe are 
isomorphic — the presentations are "merely different views 
of the same underlying object." Note that, in rewriting a 
CA as an LG, one is not permitted to introduce supple- 
mentary variables, or 'ancillae', to be used as a scratchpad 
for intermediate computations (cf. |35] and [5]); one must 
make do with whatever state- variables are already available 
at each site. 

To anticipate m2[ we mention here that the above distinc- 
tion between CA and LG makes a vital difference when one 
turns to the physical implementation of an algorithm. Even 
in ideal conditions, a fanout node — which creates copies of 
a signal — needs by its very nature a source of energy to 
operate; morever, as Landauer argued in [21j . energy is 
then turned into heat when signals are erased, discarded, 
or destroyed — as in the and gates of Fig.[Tl which take in 
two binary lines but only put out one. In this sense, CA 
provide — for the same functionality — cheaper meclianisms 
than LGs: they are are easier to fabricate (as we shall see 
in a moment), but need a power source to operate. 

From the above definitions, one can see that 

Theorem 1 Any LG is immediately rewritable as a CA. 

Proof We use the LG of Fig. [51 as a specific example, 
but the construction method is fully general; starting with 
that LG, we gradually modify it into a CA. Represent the 
pair of signals (a, b) leaving or entering a node as a single 
signal q = (a, b) (if a e A and b e B, then q e AxB). With 
reference to Fig. [31 as soon a signal q at site i crosses the 
time line t we put it through a fanout element to obtain two 
copies of it, and feed one copy to the node (embodying a 
function g') at site i and the other to that at i — 1. A node re- 
ceives, through inputs qi = (a^, bi) and qt+i = (0^+1, 6i+i)), 
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one a input from site i + l and one b input from site i — ^just 
like a node g of Fig. [2] — as well as two extra inputs, namely, 
one b from site i + l and one a from i. We define g' as 
the function which ignores the latter two inputs (starred, 
in the figure) and otherwise acts identically to g. The re- 
sulting system is a CA isomorphic to the original LG. ■ 
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Figure 3: A CA presentation of the system described by the LG 
of Fig. [2] Both a and b, bundled together as q, are duplicated by 
the fanout nodes, but then one copy of each (starred) is ignored 
by g' , which in this way effectively acts just as g. 

The outcome of this is that, despite the difference in pre- 
sentation, the functional dependence of the global state at 
t + \ on that at t is the same in both Fig. [2] and Fig.[3l 
Clearly, transcribing from an LG format to a CA one is 
easy; how hard is the opposite direction? 

We shall now draw a rough chart of the area we intend 
to explore; this area need not extend beyond C since, by 
Theor. [2 all of L (whatever its extent may turn out to be) 
is contained in C. 



injective noninjective 
surjective 



non- 
surjective 
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^invertible 

almost invertible 



locally lossy 



(1) 



In diagram ([T]) the class C of symbolic dynamics systems 
presentable as cellular automata is shown partitioned into 
four subclasses according to whether or not a system is sur- 
jective (or "onto") and whether or not it is injective (or 
"one-to-one"). This classification, which is of a fundamen- 
tal category-theoretical nature, dates back to the earliest at- 
tempts to deal with cellular automata from a mathematical 
(rather than merely phenomenological) viewpoint [291 130j . 
The diagram's proportions remind one that surjective sys- 
tems represent a vanishing fraction of the entire class, or a 
subset of measure zero — in the sense that, as one increases 
a cellular automaton's complexity in terms of size of the 
state alphabet and of the neighborhood, the fraction of cel- 
lular automata that are surjective goes to zero; similarly for 
injective systems. 

It turns out that there are no cellular automata that are 
injective but not surjective[3T]; we indicate this by graying 
out the corresponding area, which henceforth will no longer 
be of concern to us. Thus, injective CA make up a vanishing 



subset not only of the whole set but also of even just the 
surjective ones. 

We shall denote by C+ the minuscule class of systems 
that are both surjective and injective, called invertible (or, 
in physics, "microscopically reversible"). As we shall re- 
late in ^ any invertible cellular automaton can indeed be 
rewritten as a lattice gas; thus all of C"*" is in L. 

In 3S]we discuss the class C° of almost invertible cellular 
automata — those that are surjective but not injective — and 
we prove that none of C° is in L. 

What is left is the class of locally lossy cellular 
automata — that is, those that, not being especially marked 
as invertible or surjective, are (as one used to say) in "gen- 
eral position" (we'll give a more concrete characterization 
of them in a moment). These, which make up the bulk 
of all cellular automata, are this paper's main concern. 
Only a few are known to be presentable as lattice gases. 
Here we show, by means of an explicit construction, that 
all one-dimensional cellular automata of this class are so 
presentable, and indicate how this construction might be 
extended to multidimensional systems. On the basis of this 
evidence, we conjecture that any cellular automaton, with 
the exception of those of class C°, can be rewritten as a (pos- 
sibly much more complicated) lattice gas; in other words, 
that L = C+ + (cf. diagram (P)). 

In brief, concerning what parts of C are in L, 

• C+ is "in" (proved by [HIH]). 

• C° is "out" (proved here). 

• is "in" (proved here for one dimension; conjectured 
here for the general case, with an actual working 2D 
example and a plausible direction for a proof). 

4 Local recognition criteria for C+ and 

C 

The categorical characterization of an invertible CA accord- 
ing to diagram ([T]) — that it be surjective and injective, that 
is to say, that every global state (or configuration) must 
have one and only one predecessor — is rather impractical, 
since configurations are countably infinite objects and their 
number is uncountably infinite. 

In [37j we had provided a practical local criterion for 
recognizing invertibility — one that involved only local maps 
applied to a finite number of sites; namely, 

Lemma 1 There is an effective procedure for deciding, for 
any two local maps f and /', whether the corresponding 
global maps F and F' are inverses of one another. 

In a similar vein, our term "locally lossy" for a noninjec- 
tive, nonsurjective CA reflects the following practical local 
criterion (Lemma [2]), which is the only one that we'll have 
a need for in this paper, and which likewise involves only a 
CA's local map applied to a finite number of sites. 

In both cases, we feel that the reader may gain more 
immediate access to the arguments and constructions dis- 
cussed below by taking the local recognition criterion as the 
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very deRnition of the kind of system in question, and the 
categorical properties as an equivalent, more elegant but 
more remote, characterization. 

We recall that the neighbors of a site i are those sites from 
which the local map / takes its arguments at time t in order 
to compute i's new value for time t + 1. The neighborhooc^ 
X is a function which, when applied to a site i, yields the 
site's neighbors i+jn, ■ ■ ■ ^i+jh] it can thus be thought of as 
the collection joi ■ ■ ■ ,jh of these offsets. (In d dimensions, 
sites i and offsets j are represented by d-tuples.) 

We shall call patch any collection of sites, and pattern the 
collective state of a patch. If / is a patch, X{I) is defined in 
the obvious way as the union of the neighborhoods X{i) for 
all sites i of the patch. When X is understood we write / for 
X{I) and call it the (causal) closure of I — as it consists of 
all those sites the knowledge of whose state may be needed! 
in order to determine the new state of /. Just as / maps 
the state of the neighborhood i of a site i to that of the 
site itself, so does the induced function /(^' : , 
which is but a spatial iterate (a convolution) of /, map the 
state of a patch's I closure, /, to the state of the patch 
itself. When the patch / is understood, there will be no 
need to distinguish between / and f^-^\ The global map F 
of a cellular automaton of local map / is /^^ where Z"* 
represents the entire infinite rf-dimensional array of sites. If 
p is a pattern on a patch /, then / is the support of that 
pattern. A pattern P that has no predecessors, i.e., for 
which the equation f{P') = P has no solutions, is called an 
orphan pattern. 

We define a cellular automaton locally lossy if it has finite 
orphan patterns, that is, if for some finite set of sites / the 
application of / to / fails to yield all possible states for / 
itself. 

Lemma 2 A cellular automaton is locally lossy iff it is not 
surjective. 

Proof. If in a CA there is a finite patch / that has an 
orphan pattern, then any configuration containing this pat- 
tern is itself an orphan — and thus the CA is not surjective. 

To prove the converse, i.e., that the CA is surjective if 
there are no finite orphan patterns, assume that the CA 
is not surjective, and thus has a configuration C with no 
predecessors under the global map F. Take a sequence 
Co,Ci, . . . of finite patterns each containing the preceding 
one, all agreeing with C on their respective supports, and 
such that these supports cover the whole space of sites. If 
there are no finite orphans, each pattern Cj will have, on the 
closure of its support, at least one predecessor that we'll call 
Bj . If we extend in any manner Bj to the entire space we get 
a configuration, Kj. The sequence Kq, Ki, . . . being defined 
on a compact spacej^ has at least one accumulation point K , 
and from it one can extract a subsequence Kj^ , Kj-^ , . . . that 
has K as its limit. By construction, F{Kj^) coincides with 

*Also called neighborhood function, neighborhood index, or neigh- 
borhood template. 

^At least for some / having that neighborhood. 

^The configurations set of a CA with the natural product topology 
is compact by Tychonoff's theorem. Moreover, the global map of a 
CA is by construction continuous with respect to this topology. 



C on the support of Cj^; by continuity, F{K) = C, which 
contradict the absurdum hypothesis that C had no prede- 
cessors. (This proof is patterned after one by Fiorenzi |13l 
Prop. 3.2.7].) ■ 

See Footnote [6] for ties to other equivalent characteriza- 
tions. 

We use decidable as a synonym for 'recursive', and 
semidecidable for 'recursively enumerable'. A consequence 
of Lemma [T] is that the class C"*" is semidecidable. Similarly, 
Lemma [2] implies that the class as well is semidecidable. 
From this, it follows that the class C° itself must be not 
only undecidable, but also not even semidecidable. In fact, 
if C° were semidecidable, all three classes C+, C°, and C~, 
which make up a partition of the universe C, would be de- 
cidable. But C+ had been proven undecidable by Kari[T8]. 
which leads to a contradiction! 

We shall call regular a cellular automaton for which ei- 
ther invertibility or noninvertibility is eventually recogniz- 
able by local means (cf. and thus one of class C+ or C", 
and singular one that is not recognizable in that way, and 
thus of class C°. The results of this paper will make that 
terminology particularly suggestive (see Proposition [2]) . 

5 Invertible cellular automata 

The state of the art in invertible cellular automata was re- 
viewed in [37]; to avoid much repetition, we assume some 
familiarity with that material and with Kari's follow-up [20] . 
In the first of those two papers, one of us conjectured 
that all invertible cellular automata can be rewritten as 
lattice gases. This conjecture was subsequently proved, 
by Kariji)] for two-dimensional cellular automata and by 
Durand-Lose[8] for the general case, in spite of the unde- 
cidability of invertibility itselflll This undecidability nev- 
ertheless extracted a steep price; namely, there can be no 
computable upper bound to the increase in complexity (in 
terms of size of the state alphabet and of the neighborhood) 
in going from a cellular automaton to a functionally equiva- 
lent lattice gas. What this means in practice is that, though 
one can always emulate an invertible cellular automaton by 
a lattice gas, the latter may need arbitrarily more complex 
machinery per site. 

A strictly analogous undecidability issue applies to the 
present quest for lattice-gas counterparts of locally lossy 
cellular automata, and consequently we must expect anal- 
ogous complexity tradeoffs. 

6 Almost invertible cellular automata 

Here we use, for the class C° of surjective but not invert- 
ible cellular automata, the mnemonic almost invertible. In 
fact, invertible cellular automata (class C+) do not lose in- 
formation at all, while locally lossy cellular automata (class 
C~) lose in one step a nonzero amount of information per 

'^The invertibility of a dynamical system on the basis of 
its presentation as a cellular automaton is undecidable — though 
semidecidable (37) . 
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site. On the other hand, the automata of class C°, while not 
information-lossless, lose in one step such a small amount 
of information over the entire infinite array of sites that the 
loss per site is zero just as in the invertible caself] 

The simplest example is obtained by replacing the and 
gates of Fig.[l] with XOR gates. In that case every infinite 
configuration has exactly two predecessors, as in 

■•■000001111---\ 

; • • ■ 000010000 • • • 
■•■lllllOOOO---/ 

and thus the dynamics loses just one bit of information per 
time step over the entire array. 

Our question in this section is what part of C° is in L, 
that is, which almost invertible cellular automata can be 
rewritten as lattice gases. 

For the purposes of mathematical analysis, a chief advan- 
tage of a lattice-gas presentation of a dynamical system over 
a cellular-automaton presentation is that, with the former, 
the system directly inherits certain categorical properties 
that can be ascertained, by mere inspection, on the presen- 
tation itself. Namely, 

Lemma 3 If a shift dynamical system admits of a lattice- 
gas presentation, then it is injective (resp. surjective) if and 
only if every node of the lattice gas is. 

Proof. First let us observe that if S' is a Cartesian 
product of finite sets and is a componentwise map on 
S, that is, S — Ylifzj Si and (F(s))j = fii-Si), then F is 
surjective iff each of the fi is surjective. In fact, if fi{Si) = 
Si for all i, then F{S) = S; if, on the contrary, Xi G Si \ 
fi{Si) and {s)i — Xi, then s £ S\F{S). This is also true for 
injectivity: there can be two s's with the same image under 
F if and only if there are two s^'s with the same image 
under fi for some i. 

We've seen in ^ that, in a lattice gas, the state al- 
phabet A is the Cartesian product of n factor alphabets, 
^1 X • • • xAn. In turn, the state set for the entire system 
is — A^ , where d is the number of dimensions of the site 
array. Using Fig. [21 where d = 1 and n = 2, for sake of 
illustration, we can write S as 

S — --- X A1XA2 X A1XA2 X A1XA2 X ■■■ 

X~l X x+1 

where x is the site index. Even though it is true that the lo- 
cal map g maps A1XA2 into A1XA2, one may observe that, 
while output 1 of site x is used as input 1 of the same site, 
output 2 of site x is used as the input 2 of the preceding 
site, X — 1, and so forth. In general, the signal transport 
operation denoted by the arrows performs a reshuffling of 
homologous state components between neighboring sites. 
This local coordinate reshuffling, from which there ensues 

*This is just a more intuitive way of stating Moore and My- 
hill's classic theorem |29l 1301 . namely, that (in Maruoka and Kimura's 
terms [28]), a cellular automaton's global map is surjective if it admits 
of no erasable patterns. See Calude's section "Randomness in cellular 
automata" [61 9.5] for a review and consolidation of these concepts. 



at the global level a mere state reshuffling within S, is obvi- 
ously invertible, and thus immaterial for the sake of injec- 
tivity and surjectivity. The thesis follows from the previous 
observation. ■ 

Now, for a finite set — like the state alphabet A of a lattice 
gas — it is evident by the pigeonhole principle that a trans- 
formation / on it cannot fail to be injective without failing 
to be surjective as well, and vice versa; that is, if there is a 
point of A that is not in the image of /, then there must be 
some other point that has two counterimages. Thus a lattice 
gas can only be either both injective and surjective — that 
is, invertible — or neither — that is, locally lossy. We thus 
have that L C C+ U C", or 

Theorem 2 JVo almost invertible cellular automaton can be 
rewritten as a lattice gas. 

7 Locally lossy cellular automata 

All that is left at this point is determine which locally lossy 
cellular automata (these, as we've remarked, comprise vir- 
tually all cellular automata) can be rewritten as lattice 
gases. There is little in the literature to guide one in this 
task. 

7.1 Second-order cellular automata 

Indeed the only systematic result concerns second-order cel- 
lular automata, where the new state of a site x at time t-\-l 
depend not only on the state at time t of a number of neigh- 
boring sites but also on the state at time t~loix itself, as 
in Fig.[4l Of course, these can be rewritten as ordinary (i.e., 
first-order) cellular automata by means of a transformation 
analogous to the Legendre transform of classical mechan- 
ics: the second-order "Lagrangian" system with state set 
Q (the solid lines traversing time t in the figure) is viewed 
as a first-order "Hamiltonian" system with state set PxQ 
(the Q component is denoted by the dotted lines as they 
traverse time t). 

In our 2004 paper [35] we showed that all second-order 
cellular automata, invertible or not, can be rewritten as 
lattice gases. For example, the cellular automaton of Fig.[4| 
can be rewritten as a lattice gas having the format of Fig.[5l 
There, each of the shaded blocks is a very simple 8-input, 
8-output logic function (most signals happen to go through 
it unchanged) , and the overall complexity of the machinery 
per site and time step is not much larger than that of the 
original cellular automaton. Note that each lattice-gas node 
straddles four of the original cellular-automaton sites, and 
it takes four layers of nodes to process one time step. Thus, 
the structure of the LG repeats itself only every four sites 
and every four layers. However, the function computed by 
the lattice gas is translation invariant for shifts of a single 
site, just as the original cellular automaton. That is, we 
achieve a dynamical law having a given spatial regularity 
in spite of using a mechanism whose regularity pitch is four 
times as coarse! 

The straddling of several sites and the attendant coars- 
ening of the structure's translation group is a general fea- 
ture of lattice-gas presentations vis-a-vis cellular automata 
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t+1 




t-1 



Figure 4: A second-order cellular automaton. This example is 
in one dimension and has three neighbors per site ("left," "cen- 
ter," and "right of right"), besides an input from the previous 
time, indicated by a dotted line. Alternatively, the dotted signal 
can be thought of as a second state-component, and the system 
viewed as a first-order one with two coupled equations, like a 
Hamiltonian system. 



t + 1 - A^ 
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J c 



J c 



J c 



J c 



J c 



J c 



H-r--| 



) c 



J c 



J c 



) c 



t-l qPr-Vr-H-r-H-rH-hH-r-H-rH- 



x—2 x—1 



x+1 x+2 



Figure 5: Overall structure of a lattice gas representing a system 
isomorphic to that of Fig.|4l Note that each lattice gas node 
straddles four sites, and that it takes four layers of nodes to 
process one time step. 



"and" cellular automaton of !J3] and attempt to transform 
it into a lattice gas. A naive approach would be to cut 
it up into segments of, say, four sites each, and turn each 
segment into a lattice-gas node, as in Fig.O In this way, 
the neighbor links that used to connect adjoining segments 
are severed^ neighbor data coming out of the left of each 
segment are lost, while the right input to the rightmost 
gate of each segment remains unspecified. We may force 
this input to a constant value — say, 1. 



t+1 




JL 





t 



Figure 6: Here the cellular automaton of Fig. [T] has been sliced 
into four-site segments, and each segment lumped into a lattice- 
gas node. 

We have now obtained a lattice gas with 4- input, 4-output 
nodes each straddling four of the cellular automaton sites. 
The picture inside each node — the and gate and the fanout 
junction — is merely a reminder of what function the node 
is supposed to compute, not a representation of its internal 
structure. In fact a lattice-gas node, like any logic gate, 
has no internal structure at all associated with it — only an 
assigned correspondence (one may think of it as a lookup 
table) between input and output states. Specifically, no 
actual fanout (in some presumed internal mechanism?) is 
implied by the use of fanout icons in the picture. 

Moreover, the data transformation performed by this lat- 
tice gas from time i to i + 1 is clearly identical to that per- 
formed by the original cellular automaton — except for each 
node's rightmost output (marked with a star), which just 
echos the input from the corresponding site without and ing 
it with its right neighbor. Our next task will be to fix this 
problem without giving up the lattice-gas format. 



presentations — it is, as it were, the price one has to pay to 
be able to sense many neighbors without making recourse to 
signal fanout. In the case of second-order cellular automata, 
the pitch ratio (4, in the present example) coincides with the 
span of the neighborhood^ in general, though (and, specif- 
ically, in the case of invertible cellular automata), the pitch 
ratio is an unbounded function of the cellular automaton's 
complexity, owing to the undecidability mentioned in fJH 
Thus one must expect to find cellular automata for which 
the simplest lattice-gas counterpart, if one exists, has an 
arbitrarily larger and more complex "unit cell" (cf. [39]) 
than the original cellular automaton. We shall see that this 
is also the case with locally lossy cellular automata, and 
touch with hand the concrete reason why. 

7.2 When information loss helps 

We shall now sketch in an intuitive way the argument on 
which our proof and conjecture rest. Let us go back to the 



t+1 




t 



Figure 7: Adding a second layer of nodes to bridge the gaps left 
from the first layer (cf. Fig. [6]). Unfortunately, the lone AND gate 
on the second layer gets its right-neighbor signal already as it 
will be at time t + 1, rather than as it was at time t. 

In order to avoid "lateral" transmission of information 
within the same layer (which is against the rules for an LG — 
and indeed it would create a path for instantaneous trans- 



^In more than one dimension, this "ratio" will be a vector with 
one component for each dimension. 



^'^By definition, a distributed dynamical system cannot have "lat- 
eral" transmission of information — that is, between nodes that belong 
to the same time slice — because that would introduce infinite regress 
("infinite speed of propagation of information") and thus leave the 
global state under- or over-determined. 
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mission of information across infinite distances) we may try 
to add a second layer to the lattice gas, with the nodes 
of this layer bridging the gaps left from the first layer, as 
in Fig. [71 In this way, the lone and gate on the second 
layer does get, as intended, a right-neighbor signal as well 
as a center signal. However, while the center signal is that 
of time t, as desired, the right-neighbor signal has already 
been processed by the first layer and thus has the state it 
will display at time t+1. We do get an LG, it is true, but 
still one with the wrong dynamics! 



t+1 




t 



Figure 8: Here we tap the state of the a line at a' — ahead of 
first-layer processing — and forward it (dotted line) directly to 
the second layer. 

To obviate the above problem, we could get the right- 
input to the second-layer gate by tapping line a ahead of 
first-layer processing, at a', so as to forward a copy of this as 
yet unprocessed signal (dotted Hue) directly to the second 
layer, as in Fig. [51 In this way we certainly obtain the right 
dynamics-that of Fig.[l] — between t and t + 1, but the in- 
tervening mechanism is no longer a lattice gas! In fact, the 
first-layer nodes have four binary input lines but five output 
lines — and vice versa for the second-layer nodes — while we 
had postulated (in fJT]) that lattice-gas nodes should have 
equal input and output state sets. Intuitively, to perform a 
transformation involving four state variables, we have taken 
the liberty here to introduce ex nihilo a fifth state variable, 
for intermediate storage. From a physical viewpoint, the 
dynamics depends on external degrees of freedom and is no 
longer self-contained. 

Fortunately, there is one feature that hasn't yet been 
brought into play; namely, that we are dealing with the class 
of nonsurjective cellular automata, which lose information 
on a local basis — that is, a nonzero amount per site. Specifi- 
cally, even as the input lines at time t range over all possible 
combinations of binary states, not all eight possible states 
for lines a, 6, c will actually be produced; in fact, one may 
verify that, with the given dynamics, state fOf will never 
appear at the output. In other words, in going from time t 
to t + f with the and rule, the 3-bit "channel" (a, b, c) is not 
utilized at full capacity. Can one use the spare capacity to 
transport some unrelated information? For instance, that 
now carried by the dotted line in Fig.[5p 

To this purpose, let us set up a two-layer lattice gas just 
as in Fig. [51 but with a fc-site wide channel, with k to be 
chosen large enough to provided the required capacity. As 
shown in Fig.[9l in the first layer a (fc-l-I)-to-fc encoder (j) 
will squeeze the extra signal (dotted line) together with the 



k underutilized signal lines into k better utilized lines; con- 
versely, in the second layer a fc-to-(fc-l-I) decoder will 
separate the extra signal from the other k signals and de- 
liver it to its destination gate. Such a coder/decoder pair 
is termed, in the trade, a codec. 



t+1 




k+l 

Figure 9: To avoid running an additional signal line between 
first and second layer, in the first layer we use an encoder (j) to 
squeeze the dotted signal into the underutilized channel provided 
by the next k sites, and in the second layer a decoder 0"^ to get 
this signal back out. 

What is the minimum codec width k for the given AND- 
rule CA? For a first rough estimate, let's observe that by 
taking one step of the AND rule starting from the maxi- 
mally random ensemble Uq of configurations, which has an 
entropy of one bit per site, we obtain an ensemble Ui where 
at any site a I will appear with a probability of 1/4. The 
entropy per site 77 of this new ensemble will be no more 
than l/41g2 4 + 3/41g2 4/3, or « .811 (row 1 of Tablel- 
and possibly somewhat less because of local correlations. 
In fact, a tabulation of 77 for blocks of increasing length n 
allows us to estimate an actual entropy density of approxi- 
mately .7 bits/site, and thus an entropy defect of about .3 
bits/site. This is the spare "channel capacity" per site of 
the AND CA thought of as an infinite array of binary lines 
filled with a noise distribution Si . 

This very crude level of analysis tells us that the width 
of the encoder may have to be of the order of A; = 4 — since 
with a .3 bit/site capacity it would take at least 4 sites 
to squeeze through one extra bit. A more precise analysis 
must take into consideration two additional factors: 

1. We want a finite, deterministic encoder, which cannot 
tolerate errors and yet cannot, as in Shannon's theory, 
fix errors by sharing resources with adjacent blocks. In 
our case, a more appropriate quantity than the entropy 
H ol a random variable is the variety y of a function. 
Intuitively, variety corresponds to the number of dis- 
tinct values that the function takes over its domain. 
Like entropy, variety is customarily expressed on a log- 
arithmic scale [2] in an appropriate base (we shall be 
using base 2 for the moment). The (logarithmic) vari- 
ety V{y) of the output /c-tuple y = (a, 6, ... , z), when 
the A: -I- 1 inputs to a block at time t range over all their 
possible 2''+^ values, is given by column V of Table[Tl 
We see that only for fc = 8 does the variety of the "new 
state" fall below 7 (the boldface value 6.83 on row 8), 
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thus leaving enough space for one extra bit (for exam- 
ple, a') to be squeezed by the encoder into the same k 
lines. 

2. On the other hand, a' is in general to some extent 
correlated with the y fc-tuple, since a' is part of y's 
"cause." Thus we may expect that a' may contribute 
less that one bit of additional variety to the (fc + 1)- 
tuple (a' , a,b, . . . , z). In fact, it turns out that in our 
case the variety V of the {k + l)-tuple (a', a,b, . . . ,z) 
falls to no more than k already for fc = 4 (boldface 
value 4 for V' on row 4). That is the threshold we 
were looking for! 



k 


H 


V 


V 


V 




1 


0.81 


.811 


1.00 


1.58 


2 


2 


1.55 


.774 


2.00 


2.32 


3 


3 


2.25 


.750 


2.81 


3.17 


4 


4 


2.95 


.737 


3.58 


4.00 


5 


5 


3.65 


.730 


4.39 


4.81 


6 


6 


4.35 


.725 


5.21 


5.61 


7 


7 


5.05 


.721 


6.02 


6.43 


8 


8 


5.75 


.718 


6.83 


7.24 


9 


9 


6.45 


.716 


7.64 


8.05 


10 


10 


7.15 


.715 


8.46 


8.86 


11 


11 


7.84 


.713 


9.27 


9.67 


12 


12 


8.54 


.712 


10.08 


10.48 


13 


13 


9.24 


.711 


10.89 


11.30 


14 


14 


9.94 


.710 


11.70 


12.11 


15 


15 


10.64 


.709 


12.51 


12.92 


16 


16 


11.34 


.709 


13.32 


13.73 


17 



Table 1: Total entropy H, entropy density (per site) rj, and 
variety per site V of the output pattern of width k produced 
in one step by the "and" cellular automaton of Fig.[T] from a 
meiximally-random input neighborhood of width fc + 1. The last 
column, V' , gives the variety when the output fc-tuple is aug- 
mented by the input signal a' (see Fig.j^. Note that, because of 
internal correlations, this extra binary line only adds about 1/2 
bit of variety. 

We conclude that, for the CA in question, an equivalent 
LG presentation with two layers of blocks of width n = k + 1 
and an appropriate codec of width k is possible as soon as 
fc > 4. 

8 The one-dimensional case in general 

The foregoing construction was given for a simplest non- 
trivial CA (cf. fj3]), which (a) is one-dimensional, (b) uses 
a 2-state alphabet (the Boolean values and 1), (c) has 
a 2-element neighborhood template (consisting of the two 
offsets and 1), and (d) uses the and function as a local 
map on this neighborhood. 

Here, while remaining within one dimension, we shall gen- 
eralize that construction to any state alphabet A, neighbor- 
hood X, and local map f. In subsequent sections we shall 
pursue the most general goal along these lines, i.e., a con- 
struction method that applies to any number of dimensions. 

In one dimension, the diameter of the neighborhood is 
the distance between leftmost and rightmost neighbors, or 
jh —jo- (In the AND CA used in §7.2l the offsets were and 1 



and thus the neighborhood diameter was /i = l — 0=1.) A 
neighborhood may be sparse, in the sense that some offsets 
between the lowest and the highest are missing; the diam- 
eter is not affected by sparseness. Henceforward, without 
loss of generality, we shall assume that the neighborhood 
is not sparse (gaps may always be filled by dummy neigh- 
bors), and shall add a constant to all offsets so as to make 
the least offset 0. With this convention, the neighborhood 
diameter coincides with the highest offset h, where -I- 1 is 
the number of neighbors. 

The construction template we propose below for the gen- 
eral one-dimensional case, illustrated by the sequence of 
steps of Fig.[Tni has two adjustable parameters. The first is 
the neighborhood diameter h; the second, the codec size k 
(i.e., the number of sites spanned by it). The corresponding 
block size will he n = k + h. 




h k — h 



Figure 10: General template for converting into a two-layer lat- 
tice gas an arbitrary one-dimensional locally lossy cellular au- 
tomaton. The dome-like "gate" denotes the local map /. The 
parameter h is the neighborhood diameter; k is the size of the 
codec, and n = k + h the block size. Note how, in the second 
layer, neighbor inputs other than the neighbor are not taken 
from the new-state lines but from the corresponding old-state 
lines, which had been store-and-forwarded from the first layer. 
Notation conventions are explained below. 

In Fig.[Tn]we have used two conventions to avoid clutter. 
First of all, instead of having h + I neighbor wires con- 
verge onto a gate as inputs (as, for instance, in Fig. 2]), we 
have retained a separate wire only for the neighbor of offset 
0; all other h neighbors are picked up by a side extension 
of the gate itself which senses them but also lets them go 
through, so that they may be sensed in turn by all the gates 
of which they are neighbors. We thus have a notation sug- 
gestive of a cross-point matrix connecting vertical "source" 
wires to horizontal "destination" wires. This notation is 
standard in reversible logic 35 and quantum computing^]. 
Secondly, instead of indicating an indeterminate number of 
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repetitions of a certain feature (such as the circuitry of one 
site) by explicitly giving the first and the last occurrence of 
the feature and implying all the intermediate occurrences 
by ellipsis dots (". . . "), as in Fig. [51 we imply the interme- 
diate occurrences by a single grayed-out icon, as well as a 
brace indicating how many occurrences are intended in all. 
Thus, the h neighbors of a site (not including the site it- 
self, or neighbor 0) are indicated by an icon for neighbor 
1, one for the last neighbor h, and a single grayed-out icon 
standing for the remaining h ~ 2 neighbors. 

For the construction to work as intended, the k lines run- 
ning from the encoder </> in the first layer to the decoder 
0"^ in the second must be able to carry, distributed among 
themselves, not only the corresponding k new states just 
computed within the first layer, but also information about 
the old state of the leftmost h of those k sites, which is still 
needed for updating h left-over sites in the second layer. 
The encoder must be wide enough for this purpose. To 
show that a k large enough to provide the required extra 
capacity can always be found we shall be making use of 
Fekete's iemmapT] . which we restate here in a form similar 
to that used by van Lint and Wilson [40]. 

Lemma 4 (Fekete) Let g : N ^ R+ be a function 
for which g{i + j) < g{i)g{j) for all i,j e N. Then 
liuii^aD gi^y^^ exists and equals infi>i g{iy-^^ . 

Theorem 3 Any one-dimensional locally lossy cellular au- 
tomaton can he rewritten as an isomorphic lattice gas. 

Proof. Consider a one-dimensional CA of state alpha- 
bet A of cardinality a, neighborhood X of diameter h, and 
local map / : A^ A. 

Given a patch / and its closure /, for any state x ^ A' 
the local map / will determine the corresponding new state 
y — f{x) (with y G A^). Observe that, if the CA is not 
trivial, at least one state of / must have more than one pre- 
image among the states of /. On the other hand, there may 
be states of / that do not arise from any state of /; these 
are /'s orphans. By definition, a CA is locally lossy if for 
some i the corresponding patch / has orphans. 

Let us denote by g{i) the variety of the range of /, that 
is, the number of distinct states for / that actually arise 
from y = f{x) as x runs over /'s domain, A^ . Consider 
now two adjacent patches, / and J, spanning, respectively, 
i and j sites, and their juxtaposition I. J of length i -\- j. If 
the I portion of a pattern on IJ happens to be an orphan 
of /, then that entire pattern is an orphan for / J, immate- 
rially of whether or not the J portion of it is an orphan for 
J. It immediately follows that g{i + j) can be at most as 
large as g{i)g{j), and Fekete's lemma as given above applies 
verbatim. 

If we now express variety in a logarithmic fashion (see 
i j7.2p using a for a base — that is, if variety is represented by 
the quantity v{i) = \og^ g(i) — then the appropriate form of 
Fekete's lemma to be applied to this quantity is the sub- 
additive (rather than sub-multiplicative) one, that is, 

vii+j) ^ lim v{i)/i — inf w(i) — r/. 



Since the variety v{i) is in any case comprised between i 
(when the range of / is all of a*) and (when all of I's 
neighborhood states map into one and the same state for 
/), the above limit 77 is a number between and 1. 

For the construction of Fig. [TO] to work, the k lines of the 
codec (which collectively have a variety capacity of k) must 
be able to distinguish between all effectively occurring com- 
binations of values for the new state of k sites together with 
the old state of the adjacent h neighboring sites. Therefore, 
the joint variety k of all these combinations must not exceed 
the variety capacity k of the codec "channel." An upper 
bound to k is given by the sum of the marginal varieties, 
namely, h for the h neighbors and k' (to be determined 
below) for the new state of the k sites. 

If the CA is locally lossy, then v{i) is eventually less than 
i, and thus 77 < 1. In this case, given any desired variety h, 
and e > such that 77' = 77 -I- e < 1, for large enough k we 
have v{k)/k < rj' and k — v{k) > k{l — rj') > h. In other 
words, for any neighborhood diameter h we can always find 
a patch K of length k having a variety defect k — v{k) at 
least as large as h. A codec built on this patch will be able 
to transmit through k lines the variety of the new state of k 
sites together with that of the old state of h of these sites. 
■ 

We can apply Theor. [3] to Wolfram's so-called "rule 
110" 43J and conclude that 

Corollary 1 There exist noninvertible computation- 
universal cellular automata that can be isomorphically 
rewritten as lattice gases. 

Proof It is easy to verify that a codec of sufficient ca- 
pacity for the construction of Fig. [TO] to work — with h — 2 
and rule 110 as a local map — can be found as soon as fc > 17. 



9 A two-dimensional example 

Before attempting to extend the above result to an arbi- 
trary number of dimensions, it will be expedient to verify 
at the very least that the one-dimensional construction of 
§7.21 for a 2-input and cellular automaton, can be leveraged 
(with some creativity) to the analogous two-dimensional 
case, that is, a (2x2)-input and automaton. 

In the ID case of §7.21 our CA-to-LG construction yielded 
a 2-layer lattice gas that employed one codec to negotiate 
the information transfer between layers 1 and 2. The codec 
was four sites wide, and in either layer the block spanned 
by the LG node was five site wide — the 1 extra site corre- 
sponding to the neighborhood diameter, which was, in that 
case, 1. 

In the present 2D case, five layers (instead of merely two) 
will be required for the state of the lattice gas to advance 
through one time step of the cellular automaton. Four dif- 
ferent codecs will be needed to negotiate the information 
transfer between adjacent layers, i.e., 1— >2, 2-^3, 3— >4 
and 4^5. As for the size of the LG blocks, which is deter- 
mined by the size of the smallest codecs that will do the job. 



t=0 



lb 



'J 


















i 


® 


® • 


® 


® 


® 


® • 


® 


i 


!) 


® 


® • 


® 


® 


® 


® • 


® 


i 


!) 


® 


® • 


® 


® 


® 


® • 


® 


i 


!) 


® 


® • 


® 


® 


® 


® • 


® 


i 


) 


® 


® • 


® 


® 


® 


® • 


® 


i 


!) 


® 


® • 


® 


® 


® 


® • 


® 


i 






^=1 - 








^=1 







Id 























o 


o • 


® 


o 


o 


o • 


® 


( 




® 


® • 


® 


® 


® 


® • 


® 





) 


o 


o • 


® 


o 


o 


o • 


® 


( 


) 


o 


o • 


® 


o 


o 


o • 


® 


( 


) 


o 


o • 


® 


o 


o 


o • 


® 


( 


i) 


® 


® • 


® 


® 


® 


® • 


® 





^ 
















r 
























) 


o 


o 




® 


o 


o 


o 




® 


c 


















) 


® 


® 




® 


® 


® 


® 




® 


« 


) 


o 


o 




® 


o 


o 


o 




® 


c 


) 


o 


o 




® 


o 


o 


o 




® 


c 


) 


o 


o 




® 


o 


o 


o 




® 


c 


















!) 


® 


® 




® 


® 


® 


® 




® 


« 


1 























2b 



2c 



) o o 


■ V» 1^ 1^ 1^ 

• ® o o o 


■ vs/ 1. 

• ® c 


) ® ® 

) o o 
) o o 
) o o 


® o o o • 
® o o o • 
® o o o • 

® ® ® ® • 


• ® « 

• ® c 

• ® c 

• ® c 


) ® ® 


• ® ® ® ® 

^ rst 


• ® < 




) O O 


IS/ IS/ ^ ^ ^ 

® ® o o o 


® ® C 


) ® ® 

) O O 
) O O 
) O O 


® o o o • 
® o o o • 
® o o o • 

® ® ® ® • 
® ® ® ® • 


® ® C 
® ® C 
® ® C 
® ® C 


) ® ® 


® ® ® ® ® 


® ® C 



2d 



) o o 


o o o o o 


o o c 


) ® ® 

) o o 
) o o 
) o o 


® o o o • 
® o o o • 
® o o o • 
® o o o • 
® o o o • 


® ® < 

o o c 
o o c 
o o c 


) ® ® 


® ® ® ® ® 

r\ r\ r\ r\ r\ 


® ® < 



2c 

























) 


O 


o 


o 


o 


o 


o 


o 


o 


o 


c 








i) 


® 


® 


® 


® 


® 


® 


® 


® 


® 


c 


) 


O 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 










® 


® 


® 


® 


® 


® 


® 


® 


® 


c 

























3b 



3c 



) o o 


o o o o o 


o o c 


) ® ® 

) o o 
) o o 
) o o 


• ® o o o 

• ® o o o 

• ® o o o 

• ® o o o 

• ® o o o 


® ® « 

o o c 
o o c 
o o c 


) ® ® 


® ® ® ® ® 

r\ 


® ® « 




) o o 


^ ^ ^ ^ ^ 

o o o o o 


o o c 


) ® • 

) ® ® 

) o o 
) o o 
) o o 


o o o ® ® 
o o o ® ® 
o o o ® ® 
o o o ® ® 
o o o ® • 


® ® c 
® ® c 

o o c 
o o c 
o o c 


) ® • 

) ® ® 


® ® ® ® . 

® ® ® ® ® 

r\ r\ r\ r\ r\ 


® ® c 
® ® c 



3d 



J 






















) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


• 


® 


o 


o 


o 


• 


® 


o 


c 


) 


o 


® 


® 


o 


o 


o 


® 


® 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


• 


® 


o 


o 


o 


• 


® 


o 


c 


) 


o 


® 


® 


o 


o 


o 


® 


® 


o 


c 

























) o o 



3e 



) O • 

) O..® 

o-- 

) o o 

) o o 

) o o 



) o • 

) o..® 
o-- 

!l d 1^ 



o o o o o 



o o o • 

o o o..® 
o- 

o o o o 
o o o o 
o o o o 



o o o • 

o o o..® 
o- 

^ 



o o c 



® o c 

® o c 

o o c 

o o c 

o o c 



® o c 
® o c 

j^^i Q t. 



4b 



4c 



4d 



) o 


^ ^ ^ ^ ^ 

o o o o o 


^ ^ ^ ^ 

o o o c 


) o 

o-- 
) o 

) o 

) o 


• ® o o o 

.* ® o o o 
o- 

o o o o o 
o o o o o 
o o o o o 


• ® o c 

.* ® o c 

o o o c 
o o o c 
o o o c 


) o 
o-- 

\ r\ 


• ® o o o 

.* ® o o o 

o--' 

r\ r-\ r-\ r-\ r-\ 


• ® o c 

.* ® o c 

r\ r\ r\ /■ 




J 

) o 


^ \^ 

o o o o o 


o o o c 


) o 

) o 
) o 
) o 


® ® o o o 
.* ® o o o 
o o o o o 
o o o o o 
o o o o o 


® ® o c 
.* ® o c 

o o o c 
o o o c 
o o o c 


) o 

\ r\ 


® ® o o o 

.* ® o o o 

o- 

r-\ r-\ r-\ r\ r\ 


® ® o c 
.* ® o c 

r\ r\ r\ I- 




J 

) o 


o o o o o 


K 

o o o c 


) o 

o-- 
) o 

) o 

) o 


o o o o o 

-O o o o o 
o- 

o o o o o 
o o o o o 
o o o o o 


o o o c 
-O o o c 
o o o c 
o o o c 
o o o c 


) o 
o-- 


o o o o o 

-O o o o o 
o- 

^ ^ ^ ^ ^ 


o o o c 
-O o o c 



4c 



















V, 


) o 


O 


o 


o 


o o 


o 


o 


o 


c 


) o 


o 


o 


o 


o o 


o 


o 


o 


c 


o-- 
) o 




o 


o 


o- 
o o 




o 


o 


c 


o 


o 


o 


o 


o 


o 


c 


) o 


o 


o 


o 


o o 


o 


o 


o 


c 


) o 


o 


o 


o 


o o 


o 


o 


o 


c 


) o 


o 


o 


o 


o o 


o 


o 


o 


c 


o- 




o 


o 






o 


o 


c 



i 




















v. 


) 


o 


o 




® 


O 


o 


o 




® 


c 
















) 


® 


® 




® 


® 


® 


® 




® 


i 


) 


o 


o 




® 


o 


o 


o 




® 


C 


) 


o 


o 




® 


o 


o 


o 




® 


c 


) 


o 


o 




® 


o 


o 


o 




® 


c 
















) 


® 


® 




® 


® 


® 


® 




® 


i 


^ 




















r 



3a 

























) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 








!) 


® 


® 


® 


® 


® 


® 


® 


® 


® 




) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 


) 


o 


o 


o 


o 


o 


o 


o 


o 


o 


c 








!) 


® 


® 


® 


® 


® 


® 


® 


® 


® 
























r 



4a 





















) o 


o 


o 


o 


o o 


o 


o 


o 


c 


) o 


• 


® 


o 


o o 


• 


® 


o 


c 


Cr- 
) O 




® 


o 


Cr- 
O O 




® 


o 


c 


o 


o 


o 


o 


o 


o 


c 


) O 


o 


o 


o 


O O 


o 


o 


o 


c 


) O 


o 


o 


o 


O O 


o 


o 


o 


c 


) O 


• 


® 


o 


O O 


• 


® 


o 


c 


Cr- 




® 


o 


Cr- 




® 


o 


c 

















U O Iwl I O O O ^ I 



) o o 



5a 



) O O 

) Q.-C 
o- 

) o o 
) o o 
) o o 



) o o 



0-- 



o o o o o 



o o 
o o 
o o 
o o 
o o 



o o o 

O O.-C 
Cr- 

o o o 



o o 
o o 



o o o o 

o o o o. 
o- 

r\ r\ r\ r\ - 



o o c 



o o c 

o o c 

o o c 

o o c 

o o c 



o o c 
o o c 



The new state of a site, indicated by a circle, is a function of 
the current state of its four neighbors, coUectively indicated 
by four dots, according to the neighborhood template 
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Figure 11: Construction of a 5-layer lattice gas for the 2x2 AND cellular automaton. 



let us remark that the most critical passage is from layer 
1 to layer 2 (as we shall see, much less "spare capacity" is 
needed at the next three passages). Using the lossiest (the 
"most noninvertible" ) nontrivial CA rule on a 2x2 neigh- 



borhood, namely the and of the four neig hborsEI a codec 



^^This rule maps the 16 neighborhood states into 15 zeros and 1 one, 
while any surjective rule (and thus any invertible one) would need to 
map the 16 states into 8 zeros and 8 ones — see the "balanced map" 
criterion in |28) . 
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of size 4x4 will have enough capacity to pass the required 
edge-neighbor information from layer 1 to layer 20 There- 
fore, since the local map is of the format 2x2 1x1, and 
thus has a neighborhood diameter of 1 in either dimension, 
to host a codec of size 4x4 we shall need an LG block size 
of 5x5 (cf. the analog situation in the ID case, as in Fig. [5]). 

In Fig. [11] we describe the function computed by each 
layer in terms of a sequence of logical steps (4 for the first 
layer, 2 for the last, and 5 for each of the intervening three). 
We stress again that such a sequence gives a heuristics for 
identifying that function but is not meant to be a descrip- 
tion of an internal mechanism for it. Each layer will consist 
of LG nodes each simply taking its inputs from a block of 
5x5 sites and returning its 5x5 outputs to a block of the 
same size; there is no logical need to analyze this function 
as a "computation," i.e., as a series/parallel composition 
of simpler functions. What's more, in a physical imple- 
mentation of the function, in order to have the maximum 
thermodynamic efficiency (see fT2|) it may be essential to 
make use of a custom physical interaction that makes the 
5x5 signals interact with one another all at oncel What- 
ever the heuristics, in the end it may be more appropriate to 
visualize each node merely as an indivisible interaction — a 
mere lookup table. 

Layer 1 (b,c,d) 

In Fig. [Til which may be viewed literally as a "flipbook," 
we enter layer 1 immediately after t = 0; a, dot indicates 
that a site is in the "old" {t = 0) state. At step lb the sites 
are grouped into 5x5 blocks. At Ic, we compute the "new" 
state (the one that will be appropriate for t = 1), indicated 
by a circle, of all the sites that can view their entire 2x2 
neighborhood within that block; a dot within a circle means 
that we are at present in possession of the old as well as the 
new state for a given site. 

Next (Id) we ask ourselves which of the old states will 
still be needed for eventually computing the new state of 
the sites which we haven't been able to update yet; the 3x3 
old states which have exhausted their function may now be 
dropped from consideration. At le we recognize that, owing 
to the local noninvertibility of the and rule, the variety of 
the array of 4x4=16 new states is less than 16 bits. There- 
fore, the 16 lines corresponding to those sites have enough 
"channel capacity" to carry some extra information, specif- 
ically, the 7 old states which we are still carrying over. We 
thus construct, in analogy with what we did in Fig.[9l an en- 
coder (gray area) that compresses those 16-f 7 binary lines 
into just 16 lines which are passed on to the next layer. The 
9 not-yet-updated sites that make the unshaded border are 
passed on verbatim to the next stage. 

Layer 2 (a,b,c,d,e) 

We now shift the block boundaries one site leftwards. 
At 2a the 16 encoded lines are picked up by a matching 
decoder which expands them into the original 16-1-7 lines 
(step 2b). At 2c we update as many new additional sites 

^^With 16 binary lines, this 4x4 codec has a capacity for 2^^ = 
65, 536 different messages, which is more than enough for the 52,886 
different messages (combinations of new and old site values, as ex- 
plained in point 2 near the end of ^7.21 1 which we need to send through 
the codec from layer 1 to layer 2. 



as possible on the basis of old states that have just been 
made accessible within the block by shifting its boundaries; 
and at 2d we discard the old states for which we have no 
further need. Finally (step 2e) we prepare an encoder for 
squeezing through to the next level the 5 old states that still 
accompany their new states. Note that now we have more 
channel capacity available (deriving from a 5x4 rather than 
a 4x4 array of new states) for fewer lines to squeeze through 
(5 instead of 7); in other words, things become easier once 
cleared the information bottleneck of the first layer. 

Layer 3 (a,b,c,d,e) 

Block boundaries are now shifted one site upwards. Layer 
3 proceeds much like layer 2, with just one novelty. At the 
moment (step 3e) of packing 20 new states plus 2 old states 
into a 4x5 encoder, we realize that one site outside of the 
nominal span of the encoder has both old and new state 
with it, while its one line, to be carried over verbatim to 
the next layer, will have no room for both. What we do 
then is let the old state of that site go through in place, but 
"tuck away" into the encoder the new state, as indicated 
by the dashed hue. 

Layer 4 (a,b,c,d,e) 

Block boundaries are shifted one site leftwards. Every- 
thing proceeds like in layers 3 and 4. 

Layer 5 (a,b) 

We shift block boundaries rightwards, i.e., back were they 
were at the end of layer 3. We unpack everything (step 5a) 
from the decoder, including the new state that had been 
tucked away at step 3e, and at 5b we return this state to its 
original site. Now {t — 1) that there are no more old states 
left we are ready to rechristen the new states 'old states' 
and start a new 5-layer cycle. 

As long as the neighborhood remains 2x2, it is clear that 
the above construction pattern will work for a larger state 
alphabet and an arbitrary local map[il just as in fJHJ all 
that will be required is a large enough fcxfc block size. 

10 General case 

The 2D construction of fj9] can apparently be generalized 
to an arbitrary neighborhood size; that would make our 
2D result as general as the ID one. One may then wonder 
whether an analogous result holds for any number of dimen- 
sions. In support of that thesis, we'd like to observe that, 
as long as one retains the construction strategy implied by 
Eland gH 

1. The strategy does not depend on the size a of the state 
alphabet; the latter only influences the parameter k — 
the size of the codec. 

2. The strategy does not depend on the diameter h of the 
neighborhood; the latter only determines the thickness 
of the "boundary layer" of neighbors whose old state 
has to be carried through until no longer needed. 

Though, of course, still one that yields a locally lossy CA. 
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3. The strategy does not depend on the specific local map 
/; the latter, assumed to be locally lossy, only in- 
fluences, through the extent of the information losses 
(gaged by the Fekete's lemma's limit 77 of Theor. [3] or 
its multidimensional counterpart as in Lemma O, the 
size of the codec. 

4. The existence of codecs of appropriate sizes for the dif- 
ferent stages is guaranteed by the multivariate version 
of Fekete's lemma (below). 

Lemma 5 (multivariate Fekete's lemma; Capobianco'7') 
Let / : N'' — > R"*" be a function of d variables, subadditive 
in each, i.e., for any j 

f{xi, ...,Xj +yj,...,Xd) < f{xi, ...,Xj,...,Xd) + f{xi, ...,yj, ...,Xd). 
Let 77 = infa;j,...,:rd>i /{xi, . . .,Xd)/xi ■■ - Xd. For any e > 
there exist ki, . . . ,kd such that, if Xj > kj for all j, then 
fixi,. . .,Xd)/xi ■ ■ ■ Xd < t] + e. 

Therefore, each of the stages, analogous to those of ^ 
that may be needed for the d-dimensional case, is feasible 
for a large enough block size (ni, 712, . . . , n^). All that is 
missing at this point (but this may well turn out to be a tall 
order) is the guarantee that, for any d, an appropriate finite 
sequence of "progressive partial updating" layers may be 
constructed so as to ultimately achieve, in a finite number 
of steps, a complete updating of the CA. 

A bit of common sense will remind one that 

• Just by looking at the increase in logical and graph- 
ical construction complexity in going from ID to 2D, 
one may want an automated proof — or even an au- 
tomated proof generator — to tackle the complexity of 
the sequence of steps needed for more dimensions and 
organize it in a comprehensible way. For instance, (a) 
Will the number of layers be essentially proportional to 
the number of dimensions, and thus to the number of 
(hyper)faces of a block? (b) Will one also need extra 
layers to account for (hyper)edges, . . . , and so forth 
down to the 0-dimcnsional vertices? 

• Numerical verification of representative test cases is 
invaluable in complicated proofs. ("Pure" mathe- 
maticians may argue otherwise, but we suspect that, 
far from shunning such verifications, they actually do 
them automatically and subconsciously.) However, the 
computational burden of such tests grows exponen- 
tially (and extremely fast if one takes block side as 
a parameter), to the point that, even for Conway's 
"Game of Life" p3] — a moderately lossy 2-state, 3x3- 
neighborhood cellular automaton — the effective con- 
struction of the codecs is a well-nigh intractable task! 

• There are encouraging and discouraging precedents in 
this area. As we already mentioned at the beginning 
of ^ a related conjecture, which had taken six years 
to prove for one and two dimensions, took another six 
years and a very complex construction to prove in the 
general case. On the other hand, in another argument 
entailing, again, surjectivity and injectivity in cellular 
automata, the authors of 1 had thought that their 
techniques were "in principle extendable to arrays of 



higher dimension;" since, however, these techniques 
were "difficult to manage beyond dimension one," they 
expected that "generalizations of their results to higher 
dimensions" would "most likely require a different ap- 
proach." In the end (that is, almost twenty years later), 
the conjecture turned out to be wrong and its object 
undecidable[T8]! 

The evidence given, together with more work in progress 
on actual multi-dimensional exhibits, is strongly suggestive 
but not complete. For the moment, we prefer to propose 
the general case only as a conjecture, namely. 

Conjecture 1 Any locally lossy cellular automaton can be 
rewritten as an isomorphic lattice gas. 

In any event, it is remarkable how, in spite of classes C"*" 
and being at opposite extremes in the categorical clas- 
sification ll]), the construction strategy utilized here has 
strong analogies (blocks, layers, store-and-forward of infor- 
mation with repeated exchange through block edges) with 
those used by Kari and Durand-Lose. This in spite of the 
fact that the latter critically relies on a OA's being strictly 
information lossless while ours critically depends on their 
being locally lossyl 

11 Some consequences of the conjecture 

From the truth of Conjecture [1] would derive the following 
remarkable consequences. 

Proposition 1 All cellular automata, except the vanishing 
subset C° , can be rewritten as isomorphic lattice gases. 

Proposition 2 Regular cellular automata (see end of ^ are 
exactly those which can be rewritten as isomorphic lattice 
gases; singular cellular automata, those which cannot. 

12 Thermodynamic considerations 

Here we shall briefly touch on some thermodynamic aspects 
of the foregoing purely mathematical results. Though we 
could have phrased the following purely in terms of entropy, 
without ever having to embody it into "energy" and "heat," 
there will be no harm in using the more familiar, though 
less general, picture. 

As mentioned right before Theorem [TJ a concrete phys- 
ical system patterned after the "schematics" of a cellular 
automaton will require a steady supply of power to oper- 
ate and a thermal sink capable of steadily dissipating this 
power — no matter whether or not the underlying abstract 
dynamical system is invertible — to support the "document 
duplicating services" continually performed by the fanout 
nodes and the "surplus document shredding" performed by 
the many-input, one-output logic gates. 

Speaking here, for simplicity, as if our conjecture were 
true (if not, our arguments will apply at least to the cases we 
have proved) , for regular cellular automata it is in principle 
possible to build an alternate implementation, in the form 
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of a lattice gas that does not require a power supply to 
operate^ 

For those cellular automata that are invertible, besides no 
power supply, no thermal sink will be needed either, since, 
with Kari's and Durand-Lose's construction, large enough 
lattice-gas blocks allow one to make use of all relevant cor- 
relations and thus operate in an efficient, dissipationless 
fashion. The system will work analogously to a planetary 
system — a complex nonlinear svsteni^^l that can ideally run 
forever on its own initial energy, without requiring an ex- 
ternal power source. 

On the other hand, a system like Conway's "Game of 
Life" is intrinsically noninvertible, and so to implement it 
within our world's physics one will have to complement it 
with an ancillary "entropy drain" that is not explicitly rep- 
resented within the cellular-automaton model. The com- 
bined physical system (cellular automaton plus ancilla) is 
one that exhibits friction and gradually converts "high- 
grade" energy into heat. However, our construction shows 
that, much as in the invertible case, this convertion need 
not entail more than the system's initial store of energy — 
no continual power injection from outside is needed for the 
system's operation. Consequently, throughout the system's 
inBnite course of evolution no more entropy will have to be 
drained from it than the information originally contained 
in it at time t = 0, which is at most log a per site. This 
behavior is like that of an isolated inertial planetary system 
in which some of the interactions are affected by friction. 

As a dissipative system (planetary or "Game of Life" -like) 
evolves and its trajectories gradually merge, the system be- 
comes effectively closer and closer to being invertible!^ and 
the attendant heat production per time step decreases. The 
heat integral over the entire infinite trajectory from a given 
initial state can never exceed the energy contents of the ini- 
tial state. In this sense, the system's state simply "flows 
downhill" by itself, rapidly and first but increasingly more 
slowly, until its mostly settles into local equilibria or local 
attractor states, which are essentially nondissipative. Flick- 
ers of dissipative nontrivial computation must still occasion- 
ally occur (if the system is to be computation- universal), 
but ever more rarely and sparsely (cf. Dyson |^). 

The perhaps surprising, but not at all paradoxical, con- 
clusion is that a complex CA-like dynamics can run forever, 
faithfully applying its local map at each site and each time 
step, fueled, as it were, just by the negentropy of the input 
data, i.e., its own initial state. 

A physically plausible scheme for CA-like digital compu- 
tation fueled by just its input data had been proposed by 
Lent, Tougaw, and Porod[22], ostensibly realized by simple 
interactions between chains of bistable one-electron quan- 
tum boxes. There was some debate at that time as to 
whether such a scheme could actually work as purported — 

^■^More precisely, even if some form of friction should be unavoidable 
because of the nonideal behavior of the implementing mechanism, yet 
there is no theoretical lower bound set by thermodynamics to the 
amount of this friction, and thus to the amount of power that will 
have to be supplied to overcome it. 

^^Even capable of general-purpose computation, as shown in a styl- 
ized way by Margolus's "billiard-ball" model of computation 27 . 

^^For instance, a Game-of-Life "semaphore" or a "glider," taken in 
isolation, may be viewed as obeying an invertible dynamics. 



though objections mostly reduced to "the outcome seems 
too good to be true." The specific interactions sketched at 
that time may well have been too weak to support the de- 
sired behavior; nevertheless, our results show that a scheme 
like theirs is in principle feasible, though requiring the de- 
ployment, for "primitive computing elements," of rather 
complex, ad hoc physical effects. 

13 Conclusions and perspectives 

We have presented and discussed the domain of validity of 
Proposition [21 (a) proving that it applies to all ID cellular 
automata and some 2D ones, (b) giving substantial evidence 
in favor of its applying to any number of dimensions, and 
(c) outlining a plausible path to completing that evidence. 
In any event, just results (a) above are sufficient to conclude 
that 

• There exist nontrivial noninvertible cellular automata 
that can be rewritten as isomorphic lattice gases. 
These include cellular automata that are computation- 
universal. 

• Much as in the case of invertible cellular automata, 
the conversion of a noninvertible cellular automaton 
into an equivalent lattice gas entails using LG unit 
cells that span a possibly very large number of CA 
sites. The resulting LG, then, while displaying the 
same functional pitch as the original CA, must make 
use of a much coarser structural pitch — and thus of a 
much more complex local mechanism. 

• For a given computing task, the thermodynamic effi- 
ciency of a lattice gas, which unlike a CA does not 
indulge in wasteful "photocopying" and "shredding" 
practices, can only be achieved at the cost of using LG 
nodes that encompass a sufficiently large number n of 
sites. Intuitively, such a large node is able to match 
"offer" and "demand" for specific pieces of informa- 
tion between faraway CA sites, routing information to 
successive destinations as needed ("information recy- 
cling") and thus eliminating duplication and waste. 

• The required span n of the nodes of an equivalent LG 
(i.e., one capable of that efficiency) is not a computable 
function of the CA's local map /. Thus n(/) is an 
unbounded function: given an arbitrarily large n, there 
exist CA that cannot be converted into LG with a span 
n less than n sites! 

In S|3]we mentioned the use of finite-difference schemes for 
the numerical integration of partial differential equations. 
In many cases, a number of alternative schemes have been 
proposed for dealing with the same differential equation; 
of these, some may be much more complex than others. 
Why should one go to such lengths? It turns out that the 
extra cost of a certain scheme may be offset by a special 
benefit, such as improved convergence rate, better stability 
with respect to small changes in initial conditions, or strict 
conservation of quantities that are conserved in the tar- 
get differential equation. Even if each "turn of the crank" 
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should more expensive, a scheme may still be competitive 
if many fewer "turns" are needed for the same result. Or 
there may be schemes that offer different tradeoffs in the 
space and time required for a computation (see Bennett [5] 
and Li and Vitanyi[23J), and from among which one would 
choose depending on the relative "costing" of storage and 
processing resources. 

In this paper we have introduced one new type of com- 
putational scheme tradeoff, that between the one-off cost of 
an efhcient but complex infrastructure and the daily drain 
of energy (or entropy) needed to make a simpler, cheaper 
infrastructure do the same job. The spacetime regularity 
of a CA's dynamics explicitly forces one to obey the golden 
rule, "do unto others as you would have them do unto you." 
That is, to improve the thermodynamic efhciency of its own 
computational task, one CA site cannot just borrow some of 
its right neighbor's state as a scratchpad, lest its left neigh- 
bor claim some of his state — a vicious infinite regress. But 
the sresources of many CA sites can be pooled together 
into an LG "supersite." Our results and conjecture deal 
with how, for any regular CA, a large enough LG supersite 
can be constructed that achieves perfect thermodynamic 
efRciency. To paraphrase [lOj . we may term this approach 
"logic for capitahsts." 

Having dispensed with a power supply for locally lossy 
CA, if the resulting LG system has to be embedded within 
an invertible physics one must still complement it with a 
heat sink. Our construction leaves open the issue of how to 
realize such a sink while retaining the local finiteness of an 
LG scheme. Can removing and sequestering away a finite 
amount of heat per site be achieved by a finite amount (per 
site) of extra storage and processing resources by an LG 
mechanism — and thus by local means? This is a wide open 
area for research. 

Finally, there is a rather delicate question that we can 
barely mention here. We recall that computation is the 
art of putting together — even though in as large a number 
of copies as desired — a finite set of logic primitives given 
once and for all. Within any LG this condition is clearly 
satisfied, as in an LG all processing is done by composition 
of instances of a single unit cell, complex as this may be. 
But to what extent can one accept as bona fide computation 
an approach that requires designing, for each CA that one 
wants to rewrite as an LG, new custom primitives, that is, 
the nodes that make up the LG's very unit cell? Note that 
these may be arbitrary n-input, n-output boolean functions. 

Among the physical "effects" actually available in nature 
one may well hope to find one that can be exploited for re- 
alizing a simple logic primitive such as the and gate (or an 
invertible counterpart thereof, such as the so called "Toffoli 
gate" [35l|4]). But to what extent will we be able to locate in 
nature "custom Hamiltonians" capable of realizing (without 
power assist) arbitrarily complex rt-body interactions? In- 
tuitively, information recycling can make computation sub- 
stantially more efficient (cf. lOj). But, at least within the 
stylized context of CA and LG, we've touched with hand 
reasons why recycling schemes may become impractical well 
before one attains perfect thermodynamic efficiency. 
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